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ABSTRACT 

An  algorithm  for  calculating  univariate  LI  spline  fits  that  involves  multiple  minimizations  of  the  spline  functional  on  each  iteration  was 
created.  The  computational  results  for  this  algorithm  indicate  overall  good  perfonnance  of  the  procedure  but  the  procedure  is 
computationally  more  expensive  than  desired.  We  formulated  a  new  potential  algorithm  in  which  there  will  be  only  one  minimization  of  the 
spline  functional  on  each  iteration.  We  also  continued  development  of  a  new  LI  “Multiple  Component  Detection  and  Analysis”  (LI 
MCDA)  algorithm,  which  is  a  fundamental  and  complete  reformulation  of  Principal  Component  Analysis  in  a  framework  exclusively  based 
on  the  LI  nonn.  Direct  connection  with  heavy-tailed  statistics  is  a  guiding  principle.  We  completed  design  of  and  computational  results  for 
the  2D  case.  The  extension  of  LI  MCDA  to  3D  is  currently  under  way. 
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Scientific  Progress 


The  research  was  on  LI  splines  and  LI  MCDA  (Multiple  Component  Detection  and  Analysis)  in  the  following  directions: 


LI  SPLINES:  The  LI  splines  under  consideration  are  all  locally  calculated  LI  splines.  The  focus  shifted  from  univariate 
interpolating  splines,  which  had  been  the  topic  of  investigation  until  summer  201 1,  to  univariate  approximating  splines.  Two 
options  for  approximating  splines  were  considered,  namely,  smoothing  splines  and  spline  fits.  Smoothing  splines  require  the 
user  to  choose  a  balance  parameter  (parameter  that  determines  the  balance  between  how  closely  the  data  is  fit  and  how 
smooth  the  spline  is).  However,  there  is  no  theoretical  or  empirical  guidance  for  how  to  choose  the  balance  parameter.  In 
contrast  to  smoothing  splines,  spline  fits  do  not  involve  a  balance  parameter  or  any  other  parameters  that  the  user  is  required  to 
choose.  For  this  reason,  it  was  decided  not  to  proceed  with  smoothing  splines  but  rather  to  use  spline  fits.  An  algorithm  for 
calculating  univariate  LI  spline  fits  that  involves  multiple  minimizations  of  the  spline  functional  on  each  iteration  was  created. 
The  computational  results  for  this  algorithm  indicate  overall  good  performance  of  the  procedure.  However,  we  were  not  able  to 
identify  specific  advantages  vs.  previously  available  LI  spline  fits  calculated  using  an  interior-point  algorithm  developed 
previously  in  2004.  Moreover,  the  extension  of  this  algorithm  to  higher  dimensions  will  be  computationally  unattractive  because 
the  number  of  minimizations  of  the  spline  functional  required  is  proportional  to  a  constant,  for  example,  1 5  to  the  dth  power  for 
d-dimensional  LI  spline  fits.  For  this  reason,  we  have  formulated  a  new  algorithm  in  which  there  will  be  only  one  minimization  of 
the  spline  functional  on  each  iteration.  This  algorithm  is  a  steepest-descent  algorithm  to  minimize  a  global  data-fitting  functional 
under  a  constraint  implemented  by  a  local 

analysis-based  interpolating-spline  algorithm  on  5-node  windows.  Comparison  of  these  locally  calculated  LI  spline  fits  with 
globally  calculated  LI  spline  fits  previously  reported  in  the  literature  indicates  that  the  locally  calculated  spline  fits  preserve 
shape  on  the  average  slightly  better  than  the  globally  calculated  spline  fits  and  are  computationally  more  efficient  because  the 
locally-calculated-spline-fit  algorithm  can  be  parallelized. 

LI  MCDA:  We  continued  development  of  a  new  LI  “Multiple  Component  Detection  and  Analysis”  (LI  MCDA)  algorithm.  To 
properly  distinguish  this  algorithm  from  classical  PCA  (Principal  Component  Analysis)  and  robust  PCAs,  we  changed  its  name 
from  the  previous  name  LI  PCA  to  LI  MCDA.  LI  MCDA  is  a  fundamental  and  complete  reformulation  of  PCA  in  a  framework 
exclusively  based  on  the  LI  norm.  Direct  connection  with  heavy-tailed  statistics  is  a  guiding  principle.  We  completed  design  of 
and  computational  results  for  the  2D  case  and  submitted  a  manuscript  on  this  case.  LI  MCDA  is  able  to  determine  the  main 
directions  and  the  radial  extent  of  2D  data  from  Gaussian  and  heavy-tailed  distributions  without  and  with  patterned  artificial 
outliers  (clutter)  as  well  as  from  distributions  consisting  of  multiple  superimposed  Gaussian  and  heavy-tailed  distributions 
without  and  with  such  outliers.  Computational  results  indicate  that  2D  LI  MCDA  is  in  nearly  all  cases  superior  in  accuracy  to  the 
robust  PCA  of  Croux  and  Ruiz-Gazen  and  to  the  robust  PCA  of  Ke  and  Kanade  and  is  competitive  in  computing  time  with  these 
PCAs.  While  LI  MCDA  is  not  competitive  in  computing  time  with  standard  PCA,  it  is  always  far  superior  in  accuracy  except  for 
a  Gaussian-distributed  point  cloud.  The  theoretical  framework  for  2D  is  generalizable  to  higher  dimensions  for  general  pattern 
recognition  and  the  extension  to  3D  is  currently  underway.  The  local-parabola-fit-based  algorithm  of  2D  was  generalized  to  3D 
but  did  not  produce  equivalently  good  results  in  3D.  Algorithms  based  on  local  medians  and  local  linear  fits  have  been 
investigates  but  they  too  have  not  yielded  convergence  in  3D  similar  to  what  was  observed  in  2D.  The  causes  of  this  situation 
are  under  investigation  and  this  investigation  will  continue  in  a  follow-on  project. 

Lp  Averaging  with  0  <  p  <  1:  This  topic  was  not  foreseen  in  the  original  proposal  but  it  turns  out  that  it  leads  to  a  natural 
extension  of  LI  splines  and  LI  MCDA,  so  it  was  investigated  in  preparation  for  that  work.  We  generated  evidence  that  one  can 
calculate  generically  combinatorially  expensive  Lp  and  Ip  averages,  0  <  p  <  1,  in  polynomial  time  by  restricting  the  data  to  come 
from  a  wide  class  of  statistical  distributions.  Our  approach  differs  from  the  approaches  in  the  previous  literature,  which  are 
based  on  a  priori  sparsity  requirements  or  on  accepting  a  local  minimum  as  a  replacement  for  a  global  minimum.  The 
functionals  by  which  Lp  averages  are  calculated  are  not  convex  but  are  radially  monotonic  and  the  functionals  by  which  Ip 
averages  are  calculated  are  nearly  so,  which  are  the  keys  to  solvability  in  polynomial  time.  Analytical  results  for  symmetric, 
radially  monotonic  univariate  distributions  were  created.  An  algorithm  for  univariate  Ip  averaging  was  also  created. 
Computational  results  for  a  Gaussian  distribution,  a  class  of  symmetric  heavy-tailed  distributions  and  a  class  of  asymmetric 
heavy-tailed  distributions  are  presented.  Many  phenomena  in  human-based  areas  are  increasingly  known  to  be  represented  by 
data  that  have  large  numbers  of  outliers  and  belong  to  very  heavy-tailed  distributions.  When  tails  of  distributions  are  so  heavy 
that  even  medians  (LI  and  II  averages)  do  not  exist,  one  needs  to  consider  using  Ip  minimization  principles  with  0  <  p  <  1 . 
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